Corpus: nld_news_2010_100K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 94 99 99 99 99
1000 835 986 997 999 999
10000 5766 9070 9801 9954 9985
100000 31793 74618 92344 98058 99340
1000000 31793 74619 92345 98059 99341


Zipf's diagram for sentence endings


Gnuplot diagram

5770 msec needed at 2018-03-17 14:57